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Abstract. I review some of the past and current methods for retrieval 
of literature and other published information, excluding commercial ser¬ 
vices. Much of this is a personal view and based on experience made at 
various institutions, some of them neither with an adequately complete 
astronomy library, nor with a professional astronomy librarian. Rather 
than describing current retrieval methods, a few of their weaknesses are 
identified which merit future improval. Despite the availability of pow¬ 
erful electronic tools, we need to improve efforts in safeguarding pub¬ 
lished numerical and textual information in a format readily usable by 
astronomers. Comments are made on a user-friendly arrangement of a 
library, and on useful tasks for librarians with available time. 


1. Diversity of Libraries and Information Systems: A Personal View 

In this section I describe some of the experience relevant to the subject, which 
I have made at various astronomy institutions over the last 20 years. 

At Max-Planck Institut fiir Radioastronomie Bonn, Germany (1976-86) 
I became used to both a good library (complemented by the Astronomische 
Institute of University of Bonn, next door) and an excellent librarian (of the pre- 
electronic information age). In the early 1980s, the Fachinformationszentrum 
(FIZ) Karlsruhe started to offer a service providing a monthly extract of the 
latest literature on a user-specified combination of keywords (a “user profile”), 
which at that time was free of charge for users at MPIfR. However, due to 
the comprehensive collection of journals and preprints received at the MPIfR 
library in the field of radio astronomy, there was hardly any article listed by FIZ 
that I had not seen before. It was mainly the biannual volumes of Astronomy 
& Astrophysics Abstracts (AAA) which always offered some new references to 
me, although with the usual delay of about eight months after the end of the 
nominal half-year period covered by each edition of AAA. During this “pre- 
Internet” age, any exchange of bulk data or information was much slower and 
more tedious than today. I recall that the computing center provided about two 
major astronomical catalogs (to be interrogated via batch jobs). Requesting 
additional catalogs implied long delays, exchange of tapes, or the conversion of 
formats, to be compatible with local hard- and software. 

In 1987/8, at the Inst. Argentine de Radioastronomfa, facilities were pro¬ 
foundly different due to economic limitations. The collection of journals was 
limited to the very core journals, which arrived irregularly, and often with sev¬ 
eral months of delay. However, during a visit to Cordoba (Argentina) I enjoyed 
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the astronomy library when working on a project that required browsing many 
years of the core journals in astronomy. Tables and chairs in almost every cor¬ 
ridor of the library allowed one to study the volumes right next to the shelves. 

I entered the Internet age (then “BITnet”) when working at the Astronomy 
Deptartment of Instituto de Pesquisas Espaciais (INPE, Brazil), which had a 
decent collection of astronomical journals but was rather poor in preprints. By 
access to electronic mail, from 1989 on, I came across the listing of preprints 
received by the Space Telescope Science Institute (STScI) and distributed as the 
“STEP sheet” every two weeks by its librarian. Due to this unique service, I 
felt again up-to-date in astronomical literature, and together with the electronic 
mail directory of astronomers (the RGO Email Guide, by C. Benn &: R. Martin), 
it allowed me to obtain reprints or preprints from the authors rather efficiently. 
It is amazing that until today I manage to surprise some colleagues when telling 
them about the STEP sheet of which they had not heard before. However, for 
older and rarer literature, I used to visit the library of Inst. Astron. e Geofisico 
(lAG, Sao Paulo) with its exceptional collection of journals and observatory 
reports from all over the world. It was there that I discovered the usefulness of 
the journal Gurrent Gontents of Physical, Ghemical & Earth Sciences to browse 
the contents pages of recent issues of virtually all relevant journals in astronomy. 
Email allowed me to get hold of an article of interest usually within weeks. In 
some cases librarians of large astronomical institutions were kind enough to send 
me copies of articles I did not have access to, usually upon my request via email. 

Exchange of electronic copies of articles via email was still in its infancy 
in 1990 both due to network limitations and format incompatibilities. When 
I received the first email containing an article written in TeX, I had to carry 
it on a diskette to a friend who happened to have TgX installed on his PG. 
Within less than a year of its first use, email had turned indispensable for me 
to maintain collaborations and be alerted about the latest preprints. It allowed 
me to start an international campaign to safeguard tabular data from published 
journals in electronic form, which soon led to the most complete collection of ra¬ 
dio source catalogs in existence (§4^), and email led me to find an opportunity 
to continue my research at Instituto de Astroffsica de Ganarias, (lAG, Tenerife, 
Spain) in 1991. lAG possesses the biggest modern astronomy library in Spain, 
with a large variety of astronomical journals including physics, electronics and 
computer sciences, and volumes of recent proceedings in astrophysics. During 
my two years at I AG, none of my many suggestions for purchasing books were 
rejected. I AG also had a large collection of astronomical newsletters providing 
material for a review article on “Network Resources for Astronomers” (Ander- 
nach et al. 1994). In the spring of 1993 the World Wide Web (WWW) had been 
created by researchers at GERN, and it was only due to the up-to-date level of 
software maintained at lAG, that I was able to access the information (offered 
only on WWW) about the ADASS-III meeting I was going to attend. 

In 1992 the Los Alamos National Laboratory (LANL, USA) and the In¬ 
ternational School for Advanced Studies (SISSA, Trieste, Italy) started to keep 
mirror archives of electronically submitted preprints. Initially these were limited 
to the fields of theoretical cosmology and particle physics, and were not known 
to many astronomers (including myself), but have now become impressively 
popular for the deposit and circulation of astronomy preprints (§4.1.). 
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At the Observatoire de Lyon (1993/94), I enjoyed its excellent collection 
of astronomical journals, many of them available back to the very first volume. 
What I did not recall from the other libraries was the full collection of As- 
tronomischer Jahresbericht, the predecessor of AAA from 1899 through 1968. 
Despite this I recall the disadvantage that the main room with journals and 
textbooks had no table or chair where one could study (and not even space for 
it) and the next decent copy machine was two floors up. Observatoire de Lyon 
also had a good viewing facility for the Palomar and ESO/SERC Optical Sky 
Survey. Much of this can now be done with the Digitized Sky Survey on WWW, 
but the original survey plates, prints or films still offer a finer detail e.g. for 
morphological classification of galaxies and should by all means be preserved. 

In 1995/96, working at the lUE Observatory in Spain, I saw the advent of 
electronic journals like ApJ Letters, soon followed by the New Astronomy and 
the ApJ. I was surprised to find that some large data tables in MNRAS were still 
published on microfiche, in fact as late as early 1997, at a time when microfiche 
copiers had practically disappeared from astronomy libraries. When trying to 
get hardcopies of several data sets published on microfiche over the last 20 years 
I found that STScI was among the few institutions that still had such a machine 
available! For how much longer will these machines exist ? 

Since 1996, I have worked in a small but growing Astronomy Department at 
Guanajuato University, Mexico. Although various donations of sets of journals 
had been collected by the founders of the Department before my arrival, I was 
surprised that I was the first person to show an active interest in putting all these 
donations together, as soon as we found space for them. Apparently, NASA’s 
Astrophysics Data System (ADS) article service could already satisfy most of 
the needs of our department members, and it required a person of “special” 
interests to go after the needs of the library. While this may show a (perhaps 
temporal?) tendency (or even a need) for modern researchers to survive without 
a library, I continue my contacts to obtain further donations to complete our 
physical library, and wish to take the opportunity to thank several members 
of the audience who have donated material to our department. In my current 
position, I am also responsible for the library for the first time, and admit that 
I do not fulfil many of the requirements proposed further below. 

Details and URLs on many more information resources in astronomy can 
be found in Andernach (1998). In the URLs quoted here I omit the 


2. The Physical Library 

The following list of requirements for the functionality of a physical library may 
seem all too obvious, but as I do not often find them, I mention them anyway. 

The library should be a study room rather than a storage room, and should 
invite one to look at more than just that piece of literature one is searching 
for. Tables and chairs should be present in various corners to allow one to read 
and work next to the material found. Obviously a copy machine should be 
reasonably close. The item searched for should be easily locatable at any time, 
and without the presence of the librarian. Ideally, the library catalog should 
be searchable in electronic form, accessible from any user’s desktop (including 
one in the library itself), e.g. via the WWW. The search program should be 
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self-explanatory, and the code for the physical location of a certain book or 
item should be clearly marked on the shelves. The current location of books 
on loan should be made known to the interested user, either by replacing the 
book with a card containing the name of the user who lent the book, or by 
indicating this user in the catalog record. A publicly accessible ASCII hie with 
the library catalog, if not too large, is much better than nothing. Access to the 
library should be possible at all times, including nights and weekends, perhaps 
via a key or access code outside normal working hours. The reaction to books 
disappearing should NOT be access restriction, but “razzias” through the staff 
offices, and strict measures should be applied to users who do not adhere to the 
rules. 

For very literature-demanding research a computer terminal in the library 
would allow users to interact directly with their files. 

In addition to the shelf with the most recent issues of journals, the re¬ 
cently acquired books should also be displayed separately for at least a couple 
of months. Alternatively a listing of new acquisitions could be distributed to 
the users on a regular basis. Newsletters and other “grey” literature should be 
archived and made available to users if possible. Duplicate items should be kept 
and a list be posted to networks of librarians. 

3. Additional Services the Librarian conld provide 

Over the last decade about a handful of astronomy librarians have provided ser¬ 
vices of relevance to the entire astronomical community. Examples are the STEP 
sheet (sesame.stsci.edu/lib/stsci-preprint-db.html), the Astronomy Thesaurus 
(msowww.anu.edu.au/hbrary/thesaurus/), the list of astronomy meetings 
(cadcwww.dao.nrc.ca/meetings/meetings_without.html), a database of book re¬ 
views (www.astro.utoronto.ca/reviewsl.html), and the directory of astronomy 
libraries (www.eso.org/libraries/astro-addresses.html). These very successful, 
though time-consuming projects will hopefully stimulate further librarians to 
take up projects of use to a larger community. To avoid duplication, these ac¬ 
tivities should be coordinated with the established networks of librarians. Apart 
from such outstanding efforts like the above, there are services librarians could 
provide to the local users, if their time allowed. I describe a few here. 

Not many libraries can afford to subscribe to virtually all astronomy jour¬ 
nals. The exchange of paper preprints seems to be decreasing in favour of elec¬ 
tronic circulation. This means that a large fraction of current astronomical liter¬ 
ature has to be searched for at other places. The contents pages of an increasing 
number of astronomy journals are accessible in one way or another, and could 
be displayed in the library together with the other journals. A more expensive 
method is to subscribe to Current Contents which offers the contents pages of 
practically all astronomy journals. An alternative solution is to regularly consult 
the URLs of journals offering their contents pages on WWW and prepare print¬ 
outs. The UnCover database (uncweb.carl.org) includes most astronomy-related 
journals and allows one to browse their contents by issue. Listings of preprints 
submitted to the LANL/SISSA preprint server (xxx.lanl.gov) could also be dis¬ 
played at a suitable frequency (bi-weekly or monthly). Preprint lists compiled 
by the librarians of NRAO and STScI (the “RAP” and “STEP” sheets) would 
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complete the collection. While many researchers are already making use of the 
latter facilities individually, the librarian could motivate other astronomers who 
are either unfamiliar with these services or pretend to have no time to consult 
them. Since the librarian is frequently responsible for the publication list of 
an institute, these publications could be listed and offered electronically from 
the library web page. To guarantee widespread notification of these preprints 
authors would ideally deliver their preprints to the LANL/SISSA server and the 
library page would simply have a pointer to that location. 

The librarian should be accessible by electronic mail and make regular use 
of it, e.g. to subscribe to relevant distribution lists (Astrolib, EGAL, PAMnet), 
to alert colleague librarians about missing items, or to organize the exchange 
of duplicate material between libraries. Equally important for librarians will 
be the regular study of relevant information available on the Internet, e.g. on 
journals, publishers, subscriptions, societies, other libraries, etc. 

4. Electronic Journals and Preprints 

The appearance of more and more journals in electronic form has stimulated 
an intense discussion about the problem of how to safeguard their content for 
future generations (e.g. Grothkopf 1997). I shall not enter this discussion here 
but only mention a few practical problems I encountered while using them. 

4.1. The Version Problem for Preprints 

The problem of different versions of papers being circulated has been with us 
since the age of paper-only preprints. Front-line researchers are obliged to 
take much of their latest knowledge from preprints received from colleagues or 
found in the library. Most likely any changes between preprint and the actually 
published version will remain unnoticed by the researcher quoting the original 
preprint. In some cases, the part of the text referred to may have been altered 
or even have disappeared, and in rarer cases the referred paper may never be 
published because the preprint submitted was never accepted. With the advent 
of the LANL/SISSA server of preprints this situation has not changed. At least 
it is now possible to refer to a unique reference on the LANL server, and to 
retrieve a paper for the foreseeable future. In fact, the reference section of pub¬ 
lished papers contain an increasing number of pointers to astro-ph/YYMMNNN. 
The authors may change or even withdraw a submitted paper, and future re¬ 
trievals of the paper will report about this history. However, there are quite 
a few preprints on these servers, for which the authors do not state where the 
paper is supposed to appear and whether the deposited version is a submitted or 
an accepted one. In December 1997, the journal New Astronomy suggested that 
if authors wish to deposit their papers for this journal at an electronic preprint 
server prior to publication, they should not deposit the authentic accepted ver¬ 
sion. Glearly, this is not the kind of “generosity” astronomers look for, and it 
unnecessarily increases the confusion between preprint and published versions. 

4.2. Ready Access to Published Data 

The importance of a central archive for astronomical data of general interest like 
object catalogs, survey images, spectra or time series had led to the creation of 


5 



data centers like CDS and NASA-ADC in the early 1970s. With the subsequent 
development of databases like SIMBAD, NED, LEDA, etc., there was a growing 
need for electronic versions of all kinds of data sets in the published literature, 
in order to attach these data (or at least their bibliographic reference) to the 
respective object, so that a query on this object would return the article or 
even the measurement made on it. Thus the collection activity of data centers 
has gradually been expanded from larger general-purpose catalogs to smaller 
and more specialised tables in the published literature. Unfortunately during 
the past decade, many chances have been lost to archive both data tables and 
full-text versions of articles in electronic form. These obviously existed for the 
large majority of publications since the mid-1980s, but were not archived for 
future use. Only with the advent of the truely electronic editions some limited 
full-text search engines have now become available, like that for the Electronic 
ApJ (www.journals.uchicago.edu/ApJ). It is reassuring to see that the ADS 
article service (adsabs.harvard.edu/article_service.html) is approaching its goal 
of providing free access to scanned images of all major astronomy journals back 
to their volume 1. However, the conversion of these images to searchable text 
via Optical Character Recognition software (OCR) appears unrealistic. 

The recovery of the data sections of articles not previously available at data 
centers in electronic form appeared more feasible and urgent to me, and since 
1990 I have collected such data tables and catalogs in the area of extragalactic 
objects and radio sources. The radio data constitute the basis for the presently 
most complete Internet service for radio sources (“CATS”; cats.sao.ru). In the 
early 1990s, I received most tables upon request to the individual authors, while 
more recently an increasing fraction of such data sets may be obtained from the 
LANL/SISSA preprint server (risking that these versions may not be identical 
to the published ones). A substantial fraction of these tables, usually older ones, 
was recovered via a scanner and OCR, followed by a strict proof-reading of the 
result before being released to other users, databases or data centers. Many 
tables were printed in such a small font or poor quality that even recovery with 
OCR is impossible. Currently the collection contains data sets from nearly 900 
articles, of which only ~25% are also available from established data centers such 
as CDS and NASA-ADC. About 15-20 data sets are being added per month, 
and my activity has turned into that of an independent data center, proving that 
the capacity of established data centers is not sufficient to cope with the number 
of data sets published. Not surprisingly, the effort to obtain these data sets from 
the authors is being duplicated by managers of databases mentioned above and 
by other users interested in individual data sets. A better coordination between 
data centers, publishers and authors/users is badly needed. 

A good example has been set in 1994 with the agreement between the 
publishers of A&A and A&AS and CDS to archive data tables published in these 
journals at CDS. During the last four years, a similar procedure was followed by 
the journals ApJ, ApJS, AJ and RASP who published larger data sets separately 
on CD-ROM, although at a low level of completeness and with a delay implied 
by the CDs being published only twice a year. Since 1998, the publication of 
these CDs has been abandoned and the user is supposed to download the data 
from the electronic versions of these journals, carrying with it two drawbacks: 
users must subscribe to the electronic version, and the downloaded data are not 
always in a format directly usable. A few tests I made at institutions subscribed 
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to electronic journals were not too promising. Downloading a table from the 
electronic ApJ as “text” results in a file with entries separated by blank lines, 
with columns not well justified and some HTML symbols left over. The AJ 
actually offers hyperlinks to ASCII versions of its tables, but I found them to 
contain TAB symbols requiring some editing to align the columns. For papers 
whose data tables were published on the AAS CD-ROM series, the electronic 
article showed only the sample page of the table (as the printed version) and did 
not offer a link to the full files as on CD-ROM, which is an unnecessary limitation 
given the small file sizes involved. The electronic A&A and A&AS offer links 
to the CDS electronic archive for the larger tables, usually those in A&AS. 
However, smaller tables mixed within the text are often inserted as images and 
papers are not retrievable in ASCII format at all. MNRAS presently offers only 
a few sample articles in electronic form and has not foreseen any archiving means 
for tabular data it has published. 

While electronic journals have made it possible to view articles much more 
timely than previously, they can still be improved to allow working with the 
data they contain. After having converted hundreds of tables from T[ii^ format 
to ASCII, I wonder whether we really need this laborious chain of conversions 
which starts with the marking up of the original author’s ASCII table into TgX 
for the preprint, and is followed by a publication in HTML. Later, an interested 
user downloads the paper in PDF or PS format and eventually needs a document 
converter (I am not aware of reliable ones) to recover the original ASCII table. 
Clearly, the journal publishers are mainly interested in selling their journals and 
will not be inclined to collaborate in this effort. A possible solution may be to 
oblige the authors, as a condition for publication, to provide their data to the 
established data centers, including a comprehensive documentation. 

5. Bibliographical Services 

Until about 1993, browsing the AAA was about the only means for bibliographic 
searches “without charge” (except for the cost of the books). In 1993 the ADS 
abstract service (§|^) became accessible via Internet inside USA and was released 
on WWW for free access world-wide early in 1994. It is by far the most popular 
bibliographic service in astronomy (Eichhorn 1997). The latter is mainly due 
to its powerful search tools through well over a million abstracts in astronomy, 
instrumentation, physics and geophysics, covering large parts of the literature 
back to the mid-1970s. 

However, when one needs “grey literature”, proceedings, etc., or one wants 
to measure the productivity of researchers, either from their publication records 
or from citations to their papers, the ADS, and even commercial services become 
incomplete, especially for proceedings papers. 

In reaction to the recent threat that AAA may stop publication in 1999 
(www.eso.org/libraries/iau97/hbreport.html), some astronomy librarians per¬ 
formed a comparison study and found that, in particular, information about con¬ 
ference proceedings and observatory reports found in AAA is missing from ADS 
and even from the commercial service INSPEC. For many years, astronomers 
have been waiting for the AAA to become available in electronic form. Only a 
few weeks before LISA HI the Astronomisches Rechen-Institut (ARI) announced 
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ARIBIB, a database of references from AAA since 1983 (Demleitner et al., these 
proceedings). ARI plans to prepare in electronic form the entire content of AAA, 
back to 1899 from Astronomischer Jahreshericht. This will be an invaluable tool 
at least for those astronomers working at institutions subscribed to the printed 
AAA. Hopefully these privileges can be levelled out in future via funding orga¬ 
nizations, perhaps in a similar way as has gradually been achieved in the past 
to provide free access to SIMBAD. 

Drawbacks of most bibliograhical services are their incompleteness and large 
delays in the inclusion of conference proceedings. The ADS now makes an effort 
to include references to conference papers even before they go into print. It is 
desirable that such papers may be retrieved by the names of the editors or the 
conference title, and be linked to the full table of contents of the volume. 

6. Conclusions 

I have tried to identify some areas in astronomy information systems and archiv¬ 
ing methods that merit future improvement. In astronomy, the evolution of 
these systems trace a delicate path between economic interests of publishers 
and activities of individual researchers, librarians, or funding organizations to 
create non-profit systems for the astronomy community. The voluntary effort 
of a handful of astronomy librarians has provided important contributions to 
general information services, stimulated by efficient and instantaneous means of 
communication (Email) and “publication” (WWW). I hope that conferences like 
the present one will stimulate further concerted activities of that sort. It is im¬ 
portant to maintain an active communication between publishers, astronomers, 
and librarians to guarantee that new products like electronic journals continue 
to serve their consumers well. 
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